Open Resources for Language Technology
نویسندگان
چکیده
NLPFARM is an Open Source code repository for development and sharing of language technology resources. NLPFARM hosts a number of projects covering various language technology needs, providing possibilities to develop more robust and well-formed applications. NLPFARM has been in use for more than a year and our experience is that it has facilitated co-operation and sharing of resources but that there are still issues to consider.
منابع مشابه
The META-SHARE Language Resources Sharing Infrastructure: Principles, Challenges, Solutions
Language resources have become a key factor in the development cycle of language technology. The current prevailing methodologies, the sheer number of languages and the vast volumes of digital content together with the wide palette of useful content processing applications, render new models for managing the underlying language resources indispensable. This paper presents META-SHARE, an open re...
متن کاملDeveloping Punjabi Morphology, Corpus and Lexicon
We describe an implementation of morphology, development of a corpus and building of a lexicon for Punjabi language. Such resources are building blocks for various language technology tasks ranging from part of speech tagging to machine translation. Their importance is further increased by the fact that Punjabi is an under resourced language. We release these resources as open-source.
متن کاملCreation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
The META-NORD project has contributed to an open infrastructure for language resources (data and tools) under the META-NET umbrella. This paper presents the key objectives of META-NORD and reports on the results achieved in the first year of the project. META-NORD has mapped and described the national language technology landscape in the Nordic and Baltic countries in terms of language use, lan...
متن کاملMETA-SHARE: One year after
This paper presents META-SHARE (www.meta-share.eu), an open language resource infrastructure, and its usage since its Europe-wide deployment in early 2013. META-SHARE is a network of repositories that store language resources (data, tools and processing services) documented with high-quality metadata, aggregated in central inventories allowing for uniform search and access. META-SHARE was devel...
متن کاملLanguage Technology for Language Communities: An Overview based on Our Experience
IXA is a research group that has been working on language technology, mainly on Basque, during the last 28 year. As a result of years of collaboration with the Basque community and communities related to other languages we conclude that Language Technology to be an important factor for language development, previously (or in parallel) an initial core work is needed: 1) standardization and 2) ge...
متن کامل